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We present, and analyze thoroughly, a highly symmetric and efficient scheme for the determination of a 
single-qubit state, such as the polarization properties of photons emitted by a single-photon source. In our 
scheme there are only four measured probabilities, just enough for the determination of the three parameters 
that specify the qubit state, whereas the standard procedure would measure six probabilities. 
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I. INTRODUCTION 

Experiments that exploit the polarization degree of freedom 
of single photons, detected one by one, have become an al- 
most routine matter in recent years. In particular, a whole 
class of experiments that demonstrate the technical feasibil- 
ity of quantum cryptography, or quantum key distribution, use 
the photon polarization as the carrier of the quantum bit, or 
qubit. Other experiments make use of a spatial degree of free- 
dom, essentially the path qubit of a two-path interferometer, 
which is sometimes translated into the alternative of early or 
late arrival for the sake of easier transmission. 

In applications like these, as well as many others, one must 
be able to characterize the qubit source and the transmission 
channel. For this purpose a complete determination of the 
state of the qubit is required, both as it is emitted from the 
source and as it arrives after transmission. To be able to per- 
form the regular on-the-fiy calibration of the setup, so as to 
compensate for the unavoidable drifts, one needs an efficient 
diagnostics that does not consume more qubits than really nec- 
essary. 

The standard procedure measures three orthogonal compo- 
nents of the relevant qubit analog of Pauli's spin vector opera- 
tor, so that six probabilities are estimated for the determination 
of the three real parameters that specify the qubit state. But 
clearly, four measured probabilities should suffice to establish 
the values of three parameters. Indeed, such minimal schemes 
for state determination are possible, and it is the objective of 
this paper to present and analyze one such scheme, a highly 
symmetric one. 

In Sec. [n| we briefly review the standard six-output mea- 
surement scheme and then introduce the minimal four-output 
scheme, followed by remarks on state determination for qubit 
pairs. We then proceed to describe, in Sec. [HI] optical im- 
plementations for the measurement of a photon's polarization 
qubit — polarimeters or ellipsometers in the jargon of classi- 
cal optics. 

The question of how one infers a reliable estimate for the 
qubit state after the detection of a finite, possibly small, num- 
ber of qubits is addressed in Sec. lIVI After discussing the opti- 
mality of the highly symmetric four-output scheme in Sec.lVl 
and remarking on some peculiar aspects of measuring pure 
qubit states in Sec. IVII we analyze adaptive measurement 
strategies in Sec. lVllI and then close with a summary. 



II. QUBIT TOMOGRAPHY 

A. Standard six-state tomography 

We describe, as usual, the binary quantum alternative of 
the qubit by a Pauli vector operator a = (<r x , a y ,a z ). The 
physical nature of the qubit is irrelevant for the present gen- 
eral discussion — it might just as well be the spin-i degree 
of freedom of an electron, or a pseudo-spin such as the path 
degree of freedom in a two-path interferometer or the internal 
degree of freedom of a two-level atom — but in the particular 
application that we have in mind it is the polarization degree 
of freedom of a photon. Then we use the convention specified 
by 

a x = |h)(v| + |v)(h| , 

o-y = i (|h)(v| - |v)(h|) , 

a z = |v)(v|-|h}(h|, (2.1) 

where | v) and | h) are the ket vectors for vertical and horizon- 
tal polarization, respectively. 

The statistical operator of the qubit emitted by a given 
source, 



s ■ a 



(2.2) 



is parameterized by the Pauli vector s = (<?) = trjcrp}, the 
expectation value of a. The positivity of p restricts the Pauli 
vectors to the Bloch ball, s = si < 1. The experimental 
characterization of the source requires, therefore, a complete 
measurement of s with sufficient precision. Any procedure 
that can yield this information is an example of qubit tomog- 
raphy. 

In the standard approach one measures a x for some qubits 
supplied by the source, a y for some others, and a z for yet oth- 
ers. Assuming an unbiased procedure, that is for each qubit 
there is an equal chance for either one of the three measure- 
ments to happen, there are six possible outcomes that occur 
with the probabilities 



;P C± ) for Z = X) y,z. (2.3) 
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Each operator P^± is a third of a projector, and since these 
nonnegative operators decompose the identity, 

£ (P e+ +P £ _)=l, (2.4) 

they constitute the Positive Operator Valued Measure 
(POVM) of this standard six-state tomography. 

This POVM is an example of a tomographically complete 
set of measurements of pairwise complementary observables, 
namely a x , a y , and a z , so that their eigenstates constitute sets 
of mutually unbiased bases. As Wootters and Fields have 
shown |1], such sets are particularly well suited for tomo- 
graphic purposes, inasmuch as the statistical errors in the es- 
timates based on a finite number of measurements are mini- 
mal. The sets themselves are not of minimal size, however, 
because one measures six probabilities to determine three pa- 
rameters, the components of the Pauli vector s. Indeed, the six 
probabilities of Eq. d2.3t are subject to the three constraints 
p£ + + = |, £ = x, y, z, rather than to the single con- 
straint of unit sum. A minimal POVM, by contrast, would 
refer to only four outcomes and their probabilities, with unit 
sum as the only constraint. 

B. Minimal four-state tomography 

We construct such a minimal POVM of high internal sym- 
metry by first choosing four unit vectors, Si, 04, with 
equal angle between each pair, 

- - 4 1 f lforj = fc, .... 
a, • a k = -6 jk - - = I _ 1/3 for . ^ k (2.5) 

Geometrically speaking, such a quartet consists of the vectors 
pointing from the center of a cube to non-adjacent corners, as 
illustrated in Fig. Hand exemplified by 

Si = 3- 1 /2(i )1) i) ) 

a 2 = 3- 1 / 2 (l,-l,-l) , 

S 3 = 3^ 2 (-1,1,-1), 

cU = 3- 1 ' a (-l,-l,l). (2.6) 

Alternatively, one may picture these vectors as the normal 
vectors for the faces of the tetrahedron that is defined by the 
other four corner of the cube. 

The linear dependence of the a^'s is stated by their null 
sum, 

4 

53^=0, (2.7) 

i=i 

and their completeness by the decomposition of the unit 
dyadic, 

3 4 

4 S & = 1 ■ (2 ' 8) 




FIG. 1: Picturing the vector quartet of Eq. 12.51 as pointing from 
the center to four non-adjacent corners of a cube, or as the vectors 
normal on the faces of a tetrahedron. 

The perfect symmetry of the tetrahedron geometry manifests 
itself in the simplicity of this completeness relation and the 
inner products of Eq. J2.5I . As discussed in Sec. lV Bl below. 
the tetrahedron geometry is optimal in the sense that any other 
vector quartet would define a less efficient scheme for four- 
state tomography. 

Each such quartet of Sj 's defines a POVM for minimal four- 
state tomography in accordance with 

4 1 
Y Pj ■ = 1 with P 3 =-(l + Sj-S). (2.9) 

i=i 

This POVM is an example of a "symmetric informationally 
complete POVM" |2]. Upon measuring it and so determining 
the probabilities ||3|] 

p j = (P j ) = \(l + S j -s) , (2.10) 
the Pauli vector is readily available, 

8 = S^pjSj , (2.11) 

3 

and so are the statistical operator and its square, 

P = 6^p J P i -l=^(P i )(6P i -l) ) 

3 3 

p 2 = P -1 + 3J2p 2 3- (2-12) 

3 

It follows that, in addition to being restricted to the range < 
Pj < |, the probabilities pj obey the inequalities 

\<-^-Hf<-\- <-> 

3 

The upper bound is reached by all pure states, p — p 2 and 
s = 1, the lower bound for the completely mixed state, p = \ 
and a = 0. 



C. Qubit-pair tomography 

The minimal property of the four-state POVM of Eq. j2.9t 
carries over to multi-qubit states. In case of n qubits, one has 



3 



4" joint probabilities for the 4™ — 1 independent parameters 
of the 2™ x 2™ matrix elements of the statistical operator, so 
that the count is just right. By contrast, if one were to mea- 
sure n realizations of the six-state POVM of Eq. (12. 41 . one 
would have 6™ joint probabilities which contain quite a lot of 
redundant information. 

More specifically, consider the n = 2 situation of a source 
emitting qubit pairs. Using the Pj's from above for one qubit 
and corresponding operators Qk for the other, we obtain the 
16 joint probabilities (PjQk) by measuring the two four-state 
POVMs. They are the numerical ingredients in 



]T (6P 3 - 1) {PjQk) (6Q fc - 1) 



(2.14) 



j • k 



the explicit construction of the two-qubit statistical operator. 
If there are no correlations in the joint probabilities, so that 
(PjQk)=(Pj)(Qk), then this p is the product of two factors of 
the one-qubit form in Eq. J2. 12i . as it should be. The general- 
ization of the n = 2 expression d2 . 1 4t to n > 2 is immediate. 

Qubit-pair tomography of this kind requires that the vector 
quartet bk associated with the Qfc's has a known orientation 
relative to the quartet aj of the PjS. One can determine this 
orientation by "quantum measurement tomography" 0], that 
is by measuring the joint probabilities (PjQk) for a source 
with a known output |5]. In the simplest situation, for exam- 
ple, the source emits pairs that are perfectly anticorrelated, 



- ( 1 - <?« • 



(2.15) 



The orthogonal dyadic O that turns the Sj quartet and the bk 
quartet into each other, 



b k = ■ a k , a, =bj O, 



(2.16) 



is then given by 



O = -9 £ aj (PjQ k ) a k = -9j2 % (PjQk) h , (2. 17) 

j,k j,k 

and can thus be determined experimentally. 



III. ELLIPSOMETRY 

A. Standard six-outcome ellipsometer 

Devices for characterizing the polarization properties of a 
light source are called ellipsometers, a term that makes refer- 
ence to elliptic polarization as the generic outcome of the mea- 
surement. Figure |2 shows the schematic setup of a standard 
six-state ellipsometer, which realizes the POVM of Eqs. d2.3i 
and ( 12. 4i for the photonic polarization qubit. The input beam 
splitter (BS) reflects one third of the light to a polarizing BS 
(PBS) that reflects vertically polarized photons, and transmits 
horizontally polarized ones, and so directs them to two pho- 
todetectors. This branch thus realizes a measurement of <r z 
and accounts for P z+ and P z _ in the sum of Eq. (12. 4\ . 
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FIG. 2: Schematic setup of an ellipsometer that implements the stan- 
dard six-state POVM; see text. 



Two thirds of the light are transmitted at the input BS and 
are then equally split at a second BS. A photon transmitted 
there will be detected by either one of the two detectors behind 
another PBS. This PBS is rotated by 45°, so that a measure- 
ment of a x is realized in this branch and the term P x+ + P x - 
is accounted for in Eq. i2.4\ . 

Finally, the photons that are reflected at the second BS pass 
through a quarter-wave plate (QWP) before a PBS directs 
them to a third pair of detectors. This branch implements the 
measurement of <j y and accounts for the remaining terms in 
Eq. \2.4\ . namely P y+ and P y ~. 



B. Minimal four-outcome ellipsometer 

There are a number of alternative schemes for an ellipsome- 
ter that realizes the minimal four-state POVM of Eq. \2.9\ . To 
demonstrate the case, we present one scheme here and discuss 
a few other schemes, which are much simpler and much more 
practical, elsewhere |6j,|7J. 

The principle of one minimal ellipsometer is illustrated by 
Fig-Efa). It is an asymmetric four-path interferometer. At the 
input, one half of the light intensity is directed into the up- 
permost path for reference, whereas each of the other three 
paths gets one sixth of the intensity. Photons in the lower 
paths pass through wave plates that realize the unitary polar- 
ization transformations p — > cr^pcr^ with £ = x, or y, or z, 
respectively. Then all four paths are recombined by a bal- 
anced beam merger, which distributes the intensity of each 
input evenly among the four outputs. An unpolarized photon 
has, therefore, a probability of 25% for being detected by a 
particular one of the four detectors. The probabilities for a 
polarized photon are the pj 's of Eq. (12. 101 . so that the polar- 
ization POVM of Eq. ( 12. 9t is measured indeed. 

An optical network for a four-path interferometer of this 
kind is shown in Fig.|3jb). The BS at the input reflects 4/6 of 
the intensity and transmits 2/6, and the subsequent BSs either 
split the beam 3 : 1 or 1 : 1; together these three BSs imple- 
ment the initial stage of Fig. [2 a). At the central stage, there 
are wave plates in three partial beams (labeled by a x , a y , and 
a z , respectively). And the final beam merger consists of four 



(a) 



(b) 




FIG. 3: (a) Principle of a four-path interferometer for minimal ellip- 
sometry; (b) Optical network for the implementation; see text. 



1:1 BSs which direct the photons to the four photodetectors 
ft. 

The interferometer of Fig. |3jb) has several loops. Some al- 
ternative setups have two loops |9], or need a single loop only, 
among them the interferometer of the experiment by Clarke et 
al. fioll and the minimal one-loop scheme of Ref. 0, and 
yet another setup has no loop at all 0]. We note that Clarke 
et al. did not perform ellipsometry, their experiment served a 
different purpose and, although their setup could be used for 
ellipsometry, there is no mentioning of this possible applica- 
tion in Ref. Iiot . 

It is worth mentioning that such a setup can also be viewed 
as a quantum computation network for three qubits, one being 
the polarization qubit of interest, the other two qubits repre- 
senting the four paths of the interferometer jTlll . The network 
is depicted in Fig.|4] It consists of a sequence of generalized 
Hadamard gates, 




(3.1) 



At the first stage, the auxiliary qubits are prepared in a super- 
position state that has amplitude 1 / V2 for 1 00) = | vv) and am- 
plitudes 1/a/6 for |0l), 1 10), and | ll) = | hh). We achieve this 
by a controlled gate with = hit (this is a controlled-not gate, 
in fact) that is sandwiched by a gate with <j> — a and two gates 



tt/2- (3 



-tt/2 



tt/4 



tt/4 



FIG. 4: Quantum computation network for minimal qubit tomogra- 
phy. The bottom qubit is the one of interest, it enters in the state p 
whose properties are to be determined. The two top qubits are aux- 
iliary qubits that enter in state of the computational basis and are 
eventually measured. The probabilities for the four different mea- 
surement results — 00, 01, 10, and 11 — are the pj's of Eq. 12. 101 
with the Sj's of Eq. 12.61 . The network uses eight gates of the gen- 
eralized Hadamard type 13. It . three of them controlled by one of the 
auxiliary qubits, and a controlled phase gate. 



with <j) = (3, where a and (3 are such that sin(2a) = -| (\/3— 1) 
andtan(2/3) = \/3 + 1. 

The central stage has controlled gates acting on the qubit of 
interest, a gate with <f> = for a z , another one with <f> = ^tt 
for a x . For their product to realize cr y = ia x cr z , we provide 
the factor of i by a subsequent controlled phase gate, which 
implements the phase change 1 1 1 ) — > i 1 1 l), but has no effect 
on |00), 1 01), and 1 10). 

At the final stage, the two auxiliary qubits are passed 
through standard Hadamard gates, for which <j) = jir in 
Eq. ( 13. U . and are then measured in the computational basis. 
The probabilities for getting 00, 01, 10, or 11 are pi, p±, p2, 
and P3 of Eqs. J2. lQi and J2.6I >. respectively. 

Conditioned on the measurement outcome, the qubit of in- 
terest emerges in the corresponding reduced state 2PjpPj /pj. 
For the optical implementation of Fig. [2b), these final states 
of the polarization qubit are fictitious, however, unless the 
photodetectors are of a fantastic non-demolition kind: sen- 
sitive to the passage of a photon without absorbing the photon 
or affecting its polarization | ljl- 



IV. COUNTING QUBITS 
A. Maximum-likelihood estimator 

In an actual experiment, we do not measure the proba- 
bilities of Eqs. d2.3l > or J2.10> . but rather relative frequen- 
cies that are statistically determined by these probabilities. 
The available information consists of the counts of detector 
clicks, n\, ri2, n.3, «4 for the minimal four-state tomogra- 
phy of Sec. Ill Bl and n x ±, n y ±, n z ± for the standard six-state 
scheme of Sec. Ill Al In what follows, we focus on the novel 
four-detector situation. 

In view of the intrinsic probabilistic nature of quantum phe- 
nomena, a given total number of N qubits does not result in a 
definite, predictable number of clicks for each detector. It is, 
therefore, clear that an observed break up, 



N = ni + n 2 + n 3 + n 4 , 



(4.1) 
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FIG. 5: The likelihood function as a probability density for the Pauli 
vector s. The plot shows the "likelihood clouds" for simulated mea- 
surements for which the most likely Pauli vector has length s = 0.84 
and is pointing toward one of the corners of the cube of Fig.Q A to- 
tal of 100 qubits have been detected for the bottom right corner, 200 
for the bottom left corner, 400 for the top rear corner, and 800 for 
the top front corner. The successive shrinking of the cloud is clearly 
visible. 

is consistent with not just one Pauli vector s in Eq. (I2.2L but 
with many. One expects that, as a rule, the obvious guess that 
obtains from Eq. (I2.1H upon the replacement pj — > Uj/N, 
namely 

3 

gives a reliable estimate of the true Pauli vector s. But it may 
happen that the length of this inferred S exceeds unity, and 
this is in fact a typical situation if the true s is of unit length, 
that is the source emits qubits in a pure state. 

Let us thus proceed to show how one infers a plausible 
and physically correct answer on the basis of the registered 
data. The experimental data consists in fact not just of the 
total counts rij in the break up ( I4.lt . but of a particular se- 
quence of detector clicks. Provided that the source emits 
qubits in the state specified by the Pauli vector s, with no sta- 
tistical correlations between different qubits, the probability 
£(s; m, . . . , rc.4) for getting the observed click sequence is, 
therefore, given by 

4 

C = Y[ P] (^, (4.3) 

i=i 

where Pj(s) is the probability (12 . 1 Oi that a qubit is registered 
by the jth detector. 

Conversely, in the spirit of the Bayesian principle of statis- 
tical inference, we can regard C(s; nj., . . . , rc.4) as the likeli- 
hood that the source is characterized by s, given that a click 
sequence with total detector counts m, . . . , n.4 was observed. 
When many qubits have been counted, the likelihood function 



is sharply peaked and essentially vanishes outside the imme- 
diate vicinity of its maximum. These matters are illustrated 
by the four "likelihood clouds" in Fig.|5] 

The maximum-likelihood (ML) estimator S picks out the 
most likely Pauli vector, the one for which C is largest, 

max£(s>£(S). (4.4) 

s 

For the purposes of this paper, we accept this S as our plau- 
sible guess for s, while being fully aware of other strategies 

EG1E1. 

An important theorem by Fisher states that ML estimators 
become efficient in the large-iV limit flrjl fl7ll . So the per- 
formance, for large N, of an experimental tomographic setup 
can be quantified by the accuracy of the ML estimator. Usu- 
ally an analytical expression for the ML estimator is not avail- 
able and one has to solve a nonlinear operator equation for 
p = i ( 1 + S- (?) to find it lll8ll . In the present context, however, 
the high symmetry of the vector quartet of Eq. d2.5i makes it 
possible to simplify this problem considerably. 

There is a benefit in maximizing the right-hand side of 
Eq. i4.2\ with respect to the probabilities pj rather than with 
respect to s. Lagrange multipliers are used to account for the 
two constraints. One is the unit sum of the pj, J2j Pj = 1> that 
is the unit trace of p, the other is the positivity of p, p > 0, 
which is the upper bound in Eq. ( 12. 1 31 . 

Without this positivity constraint, the likelihood J4.3I 
would be maximized b y p 7 = i/j, which in turn would im- 
ply the Pauli vector of (14.21 . But, if the actual counts violate 
the inequality 

3 

this simple estimation fails to provide a physically meaningful 
result. When this happens, both constraints must be taken into 
account and the likelihood £(s) attains its maximum on the 
boundary of the set of qubit states, that is for a Pauli vector of 
unit length fisfll . 

The variation of the likelihood vanishes at the extremal 
point, 

(Slog£ = X]— <*Pj=0. (4.6) 

3 Pl 

The variations Spj are subject to the two constraints 

£<Jp,-=0, 5>^p,- = 0, (4.7) 

3 3 

for which we use the Lagrange multipliers NX and 3 Np, re- 
spectively. Denoting the P j values at the extremal point by §j, 
( 14. 6> then implies 

^- = \ + 3pPi for j = l,...,4. (4.8) 

P3 

We exploit J2j u j = 1> J2 P3 = 1. and J2j p] = \ t0 estab- 
lish 
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and 



1 = A + /i , 



(4.10) 



the first by summing the four equations in (14. 8> . the second by 
summing them after multiplication with pj . 

Taken together, Eqs. J4.8i — (14. IQi make up a set of six equa- 
tions for the six unknowns: p\, . . . , p~4, A, and \i. We solve the 
quadratic equations J4.8I for pj, or rather Vj/pj, 



11 

Pj 



A 



A 2 + 12//ZA 



(4.11) 



and J4. IQi for A and substitute these into Eq. ( 14.91 to arrive at 
a single equation for fi, 



2-^/(W^TT2^=0. 



(4.12) 



There is always the solution /i = 0, and thus A = 1, but ( 14. lit 
amounts to ( 14.21 for these values, and therefore this solution 
is not acceptable, when the inequality (14.51 is not obeyed, as 
is the case in the present discussion. Upon recognizing that 
Eq. ( 14.121 can also be written as 



(3i/j - l)vj 



— fl + &lWj + — /i) 2 + 12^1 



= (4.13) 



we can discard the unphysical solution /i = and find the 
relevant solution in the range < \l < 2 as the root of this 
sum over j. The upper bound \i = 2 is reached when all qubits 
are detected by the same detector, so that one of the Vj equals 
1 and the others vanish. 

In summary, we determine the ML estimator 



Pj' 



(4.14) 



as follows. If the inequality (I4.5i is obeyed by Vj — rij/N, 
we take pj — Vj . Otherwise we find /i as the positive root 
of ( 14.121 or d4. 1 3t and then get the four pj 's from 14. Ill with 



A : 
Pj 



\x. In the extremal situation of Vj 
\5j k and S = a k . 



Ojk- 



we have 



B. Many detector clicks 

In this procedure for finding the ML estimator, there is a 
crucial difference between relative frequencies i/j that obey 
the inequality ( 14. 51 ). and can therefore serve as probabilities, 
and those that violate the inequality. When the total number 
N of detected qubits is small, statistical fluctuations are rel- 
atively large and a violation is hardly surprising. But what 
is the typical situation for a large number of detector clicks, 
should we expect (14.51 to be obeyed or violated? 

For the likelihood J4.3I . only the break-up (14. II matters, 
not the particular sequence of detector clicks. There are 
i\n/(ni!n2!n 3 !n4!) different sequences for a given break-up, 
so that there is a multinomial statistics for the probability of 



getting a particular break-up for the given probabilities pj (s) 
of Eq. ( I2.10> . We denote the corresponding averages over pos- 
sible break-ups by over-bars, as illustrated by 



■ = NVJ=Npi (4.15) 



and 



VjV k = p,jp k + — (S jk p k - pjp k ) . (4.16) 

Upon recalling that £\ p] = (3 + s 2 ) j\% cf. Eq. Ol . the 
latter averages imply 



i 

N 
1 

3 " 



N-l 



N 



1-s 2 



T,PJ 

j 



12 



UN 



(4.17) 



so that, on average, the inequality J4.5I is violated for pure 
states (for which s = 1), and is obeyed for mixed states (for 
which s < 1). We thus expect that the detector counts for a 
pure qubit state will typically violate inequality ( 14.51 . whereas 
the counts for a mixed state will tend to obey it. 

As a more precise statement about this matter, we note that 
the fraction of detector click sequences that violate the in- 
equality is 



prob(violation) = - + -erf ( — — ( V" (4.18) 

2 2 \ 2/^ V 



where erf ( ) is the standard error function and k is given by 

3 j 

Equation ( 14.181 applies for ^> 1; in order to derive it, first 
observe that the central limit theorem ensures that 



j 

= ex p(~2jyE a o ( s jkPk - PjPk)ak) (4.20) 

for large N [2Q]. Then use this generating function for the 
mean values of products of the Uj's to calculate the probability 
that£ 



> 



with consistent approximations for N ^> 1. 
For pure states, the argument of the error function in (14.181 
is 1/(3kV~N), which is always positive, but decreases with 
growing N, so that click sequences that violate the inequality 
are more frequent than the ones obeying it, but they are not 
much more frequent. A remarkable exception is the situation 
of the Pauli vector s being exactly opposite to one of the direc- 
tions of the vector quartet of Eqs. ( 12.51 and ( 12.61 . Then one of 
the pj's vanishes and the other three are all equal to |, so that 
k = and there is a unit probability for getting a violation. 
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For mixed states, the argument of the error function is neg- 
ative and increases in magnitude oc y/N (more about this in 
Sec. IV Cb . Accordingly, a violation of the inequality is highly 
improbable. In the extreme situation of the completely mixed 
state, p — |, all p/s are 4, so that n = here too, and the 
probability for violating the inequality vanishes. 



C. Asymptotic efficiency 

After detecting many qubits, we expect the ML estimator S 
to be quite close to the true Pauli vector s. If inequality ( 14.5b 
is obeyed, the average squared distance 



(As) 2 = (S-s) 
12 



S('-5>: 



N 



(4.21) 



is immediately available as a consequence of Eq. J4. 16b . 

The analysis is more involved when J4.5I is not obeyed. 
Let us consider once more the extreme situation of s being 
exactly opposite to one of the aj, such as s = —04, say, so 
that P4 = and v± = 0, and v\, U2, 1^3 differ little from 
pi = p2 = P3 = |. To leading order in Vj — |, the solution 
of Eq. ( 14.12b is then 



/' : 
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1\ 2 



(4.22) 



and the resulting ML estimator is 



1 3 3 3 

5 = -^(1 + 3^)^=5 + -^ 



,1/4 a,- 

2 3 3> 
3=1 



(4.23) 



In conjunction with Eq. J4.16I . we thus get the large- N ap- 
proximation 



(As) 2 = 3^ + ^ + 4 -1 = - 



(4.24) 



in this case of perfect anti-alignment between s and the quartet 
of dj 's. The sensitivity to small misalignments is discussed in 

Sec. lyTTcl 

As demonstrated by the numerical results of Fig. [5] these 
asymptotic approximations are actually quite reliable for N > 
1000. The plots report the mean values, with statistical error 
bars of one standard deviation, of the distance I S — s\ as ob- 
tained from 40 simulated experiments, whereby up to 6000 
qubits are detected in each run. The top plot refers to a true 
state that is completely mixed, so that s = in Eq. ( 14. 21b . In 
the bottom plot we have s = —d 4 and ( 14.24b applies. 

By comparison, for the standard six-output ellipsometer 
with counts n^± of the respective detectors, the ML estima- 
tor is 



5' 



l y+ 



(4.25) 
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FIG. 6: Mean value of the distance from the ML estimator S to the 
true Pauli vector s for 40 simulated experiments with up to 6000 
detected qubits per run. The top plot is for s = 0, the bottom plot 
for s — — £14. The solid lines indicate one standard deviation to each 
side of the mean value, as obtained from the large- N approximations 
for (As) 2 in Eqs. 14.211 and 14.241 . respectively. 



provided that the length of this S does not exceed unity. If it 
does, the search has to be constrained to unit vectors, much 
like in the discussion of Eqs. d4 . 6I >— d4~T4l > . When Eq. ( 14.25b 
applies, the error corresponding to ( 14.21b is given by 



9 -3s 2 
N 



(4.26) 



For s > 0, this is a bit smaller than ( 14.21b . so that the six- 
output scheme gives a slightly more reliable estimate. But 
when s gets close to unity, the four-output scheme can provide 
better estimates because there are states for which the error is 
substantially smaller, as the extreme situation of Eq. (14.24b 
shows. For the six-output ellipsometer, the privileged pure 
states p project on an eigenstate of a x , or a y , or <j z , so that 
one of the six detectors in Fig. |2]never registers a qubit. Then 
we have 



(As)' 



3N 



(4.27) 



which is a bit larger than its analog ( 14.24b . The main lesson 
is thus that the two schemes are of comparable asymptotic 
efficiency. 
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V. OPTIMALITY OF MINIMAL FOUR-STATE 
TOMOGRAPHY 

A. Cramer-Rao bound 

As a preparation of the discussion below on the optimality 
and efficiency of the minimal four-state tomography, we re- 
call some well known facts about state estimation. We regard 
the qubit state as parameterized by the three components of 
its Pauli vector s — (s x , s y ,s z ), rather than by the four prob- 
abilities pj, because the latter are constrained. Then, the error 
of the components of the ML estimator S can be estimated by 
the Cramer-Rao lower bound lfl7ll2lll . 



(A^) 2 > (J F 



with £ = x, y, z, 



where I-p is the Fisher information matrix, 



(*0 



d log C d log C 



(5.1) 



(5.2) 



and the averaging is done over the data, that is over the multi- 
nomial distribution that we exploited already in Sec. lIVBl 

There is no unique numerical measure for the comparison 
of the estimated state p cst = |(1 + S • (?) with the true state 
Ptruc = |(1 + s ■ a). One can make a case for the trace-class 
distance 



1 r, 



Ptruel} = 7j\S - sj , (5.3) 



for the Hilbert-Schmidt distance 



V 



HS 



[tr{(p e 



Ptr 



! }] 



1/2 



\S-s\, (5.4) 



for the Uhlmann fidelity 

U = tr{| v / ^ v / / o true |} 
1 



with 



1 + S-S+ v-/ 



1/2 



1 + 5 -s 



(5.5) 

1/2 



{S + s) 2 -(Sxs) 2 . 



(5.6) 



the two Pauli vectors. If one state is mixed, however, this is 
not true for the Uhlmann fidelity. For the sake of computa- 
tional simplicity, we opt for the (square of twice) the Hilbert- 
Schmidt distance, and thus quantify the measure of the es- 
timate by the size of (S — s) 2 , whose statistical average is 
already considered in Sec. IIVCI A convenient conservative 
estimate is given by the Cramer-Rao bound of Eq. d5.lt . 

According to Fisher's theorem 1 16] the ML estimator of p 
attains the Cramer-Rao bound for large N. Strictly speak- 
ing, this statement applies only to mixed states, because only 
for them the ML estimators are unbiased. For pure states, the 
positivity constraint plays an important role, and the Cramer- 
Rao bound derived for unbiased estimators tends to overes- 
timate the error, and therefore we shall consider pure states 
separately below. As a rule of thumb, the Cramer-Rao bound 
is reliable when the lion's share of the likelihood function is 
contained within the Bloch ball. This can, in fact, be used as 
an operational definition of "mixedness". Indeed, as the anal- 
ysis in Sec. lIV Bl shows. for mixed states there is no significant 
fraction of the likelihood function outside the Bloch ball if N 
is sufficiently large. 



B. Optimality of the tetrahedron geometry 

The four-state tomography with the tetrahedron geometry 
of Eqs. J2.5H2.9> is the best minimal qubit state measurement, 
in the sense that, among the measurements with four output 
channels, it provides the greatest accuracy. This can be seen 
as follows. 

For the multinomial statistics of Sec. lIVBl the Fisher infor- 
mation simplifies to 



_ N -y 1 d P j d Pj 



(5.8) 



where pj is the probabilities of detecting a qubit in the j-th 
output channel. Consider now this variational problem: For 
a given input state, the average distance D is to be mini- 
mized over all possible four-element POVMs. The functional 
in question is 



^ = Sp(/ F 1 )-A^n, 



(5.9) 



and for some more. In the single-qubit situation of present 
interest, there is no difference between T>ti and X>hs, but they 
are not the same for higher-dimensional systems, such as the 
qubit pairs of Sec. Ill CI We note in passing that the Uhlmann 
fidelity and the trace-class distance are natural pairs, inasmuch 
as they obey a fundamental inequality, 



w < 1 . 



(5.7) 



irrespective of the dimension of the Hilbert space. 

As long as one is comparing pure states with each other, the 
actual choice between the quantitative measures of Eqs. ( 15. 3> - 
i5.5\ does not matter much, because then all of them are 
monotonic functions of the length of the difference S — s of 



where Sp( ) is the trace ("spur") of the 3x3 matrix, and a 
Lagrange operator A takes care of the constraint J^j H = !• 
First, let us find the POVM that minimizes the functional 
d5.9i for the maximally mixed input state. In that case the 
extremal equations read 



RjUj = AUj for j = 1 



j . . . 5 -i 5 



where 



(5.10) 



(5.11) 
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involves the operator matrix Tj whose matrix elements are 
given by 



(5.12) 



Note the joint appearance of 3 x 3 Sp( ) traces and quantum- 
mechanical tr{ } traces. One verifies by inspection that any 
POVM of the tetrahedron geometry of Eqs. ( I2.5H2.9I satisfies 
these extremal equations and hence identifies the maximally 
mixed state with greatest accuracy. 

Obviously, for biased states the optimal POVM itself be- 
comes biased. The generalization of the extremal equations to 
this case is straightforward; the operators Tj will then contain 
one more term proportional to the true state p. Although an 
analytical solution may be difficult to obtain, one can always 
find the extremal measurement by an iterative procedure. As 
the input state is usually not known and selected in random, 
operators Rj should be averaged over the Bloch ball. In this 
way one obtains an algorithm providing the optimal sequential 
measurement, in the sense that the Hilbert-Schmidt distance is 
consistently reduced in each iteration step. Numerical results 
show that the tetrahedron POVM is also optimal for uniformly 
distributed input states. 

Quite explicitly, the optimal distance for the tetrahedron 
POVM is 



D, 



9 



opt 



N 



(5.13) 



in agreement with, or as a consequence of, the mean square 
distance of Eq. A4.2 II . As one would expect, the accuracy of 
the ellipsometer is somewhat better for pure input states than 
for mixed states. But, most importantly, the accuracy does 
not depend on the orientation of s relative to the measurement 
tetrahedron. 



C. Orientation of the measurement tetrahedron 

Their relative orientation is, however, not completely irrel- 
evant. For example, the value of k 2 in d4. 191 clearly depends 
on it. More generally, we can obtain the large- N mean values 
of functions of S — s from the asymptotic generating function 

exp^ir • (S — s)^ = exp( 



2N 



K 



(5.14) 



where the dyadic 



PjPkjak 



(5.15) 



depends on the positioning of s relative to the vector quartet 
of t he a£ s. This is, of course, an immediate consequence of 
Eq. Km . 

As an application, let us consider the asymptotic mean 
value of the Uhlmann fidelity of Eqs. J5.5I and \5.6\ . We first 
note that 



U = 1 



s) 2 



1-s 2 



(5.16) 



holds when S — s is small. Then we recall Eq. ( 14. 211 and 
extract 



• (S - 3 
from J4.25I to arrive at 

77= 1 - 



2 1 



72k 2 
N 



9-s 2 
8N 



9k 1 



N(l~s 2 ) 



(5.17) 



(5.18) 



where we meet the orientation-dependent quantity k 2 of 
Eq. KM . 

There are extremal orientations of three kinds. The value 
of k 2 is largest when s is parallel to one of the vectors of the 
tetrahedron quartet, and smallest when s is antiparallel. In 
addition to these four maxima and four minima of k 2 , there 
are also six saddle points that have s parallel to the sum of 
two different vectors of the quartet. The maximal and minimal 
values of k 2 are given by the upper and lower signs in 



(1±s)(3ts)s 2 
72 



for s = ±sctj (any j), (5.19) 



respectively, and the value at the saddle points is 

2 (3-s 2 )s 2 _ V3 ,^ _ v 

K = — for s = — s{a,j + a k ) (any j k). 

_ (5.20) 
Accordingly, the approach of U to unity is fastest for the s = 
— sdj orientation, for which the large- N approximation 



_ (3 + .s)(3 + 2.s) 



(5.21) 



applies. A small value of k is also advantageous in Eq. ( 14.181 1. 
as it ensures a large negative argument of the error function 
and thus a small probability for violating the inequality ( 14. 5> . 



VI. MEASURING PURE QUBIT STATES 

When the measured quantum system is known to be in a 
pure state — which is a bold over-idealization of any realistic 
situation — this knowledge can be exploited systematically 
when estimating the otherwise unknown state. A somewhat 
more realistic situation arises when the input state is pure but 
we do not know this to begin with, although this case is quite 
a bit artificial as well, because one can hardly assume that real 
sources are not affected by classical noise, or that the exper- 
imental setup is totally decoupled from the environment. Put 
differently, it is far-fetching to assume that the experimenter 
will ever have the perfect control that is necessary to ensure 
that a source emits a pure state. Nevertheless, there is an inter- 
est in such idealized scenarios because they come up in anal- 
yses of eavesdropping attacks on schemes for quantum cryp- 
tography, where — as a matter of principle — it is assumed 
that the eavesdropper is only limited by the laws of physics, 
not by practical limitations. 
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For s = 1, the Uhlmann fidelity (15. 5> simplifies to 



U 



S 



(6.1) 



available as a consequence of Eq. (14.16b . are used in 



and is equivalent to 



(6.2) 



if the estimator is a pure state as well, S = 1. When indeed 
estimating pure states with pure states, the situation to be con- 
sidered now, it is customary to take the average of U 2 , 



■ U 2 



(6.3) 



as the fidelity measure that judges the quality of the estimation 
procedure. One must keep in mind that both the true Pauli 
vector s and its estimator S are unit vectors here, as Eq. d6.3t 
applies only under this restriction. Knowing that the source 
generates pure states means having a lot of prior knowledge 
because the set of pure states is much smaller than the set of 
mixed states (the Bloch sphere rather than the Bloch ball), and 
so one can safely expect that a better, possibly much better, 
accuracy of the estimation can be achieved. 

The average fidelity of the optimal joint measurement on N 
copies is known to obey the inequality j22ll23ll 



— N - 

F < 



1 



iV + 2' 



(6.4) 



so that the corresponding error 1 — F will decrease as 1 /N in 
the large-TV limit. Any reasonable estimator should show this 
dependence. 

In our scheme the qubits are measured individually, not 
jointly. Nevertheless, as a consequence of the Fisher theo- 
rem in general, and of the findings in Sec. lIVCl in particular, 
the variance (As) 2 in Eq. (16. 3> is proportional to 1/N in the 
large- N limit, and so is then 1 — F, But we need to recon- 
sider the asymptotics of the maximum-likelihood estimation, 
now taking into account that both s and S are restricted to the 
Bloch sphere, so that we get the estimator S from Eqs. fl4. lOt — 
(14. 1 31 . whether inequality J4.5i is obeyed or not. 

Here, too, the estimation is sensitive to the orientation of the 
Pauli vector s*relative to the vector quartet of the measurement 
tetrahedron. We deal first with the case of "not anti-aligned," 
that is s 7^ — dj for all a/s. Then, Vj — pj and /i are of the 
order of 1/ vN and /i is given by 



with k 2 > from Eq. ( I4.19> . The resulting averages 



(6.5) 



9Nk 2 
2 

9Nk 2 



(3pj - l)pj , 



(6.6) 



to arrive at 

1-F 



2 

N ~ 3N 



N 9N 



p1-i 



(6.8) 



In particular, when the tetrahedron is aligned with the Pauli 
vector, the upper-sign case of ( 15.191 1 with s = 1, we have 



= 1/18 and 27 V p7 = 7/4, so that 



1 — Ft\ 



1 

N 



(6.9) 



for this parallel strategy ( j|). This is only half as big as what 
one would get for the s = 1 value of Eq. ( 14. 21L and thus 
demonstrates the advantage of estimating the pure true state 
by pure-state estimators only. 

There is no such advantage for the anti-parallel strategy 
(t|) that has the tetrahedron anti-aligned with s, because the 
argument in Sec. lIV Bl establishes that the ML estimator is al- 
ways a pure state then. Since this is now the lower-sign case 
of d5 . 1 91 for s = 1, we have k 2 = and Eqs. ( !6.5> -( l6~8l are 
not valid. Instead, we recall that Eqs. ( 14.22b — J4.24l > apply, and 
conclude that 



1 - Ft: = 



1 

2N 



(6.10) 



holds here. The anti-parallel strategy has thus half the average 
error of the aligned strategy. 

On the other hand, for a generic orientation of the input 
state there will always be an uncertainty in the length of the 
input state vector and therefore the fidelity will approach unity 
at a much slower rate proportional to 1/ >/~N. Only if one 
knows beforehand that the input state is pure, the 1 /N rate 
can be achieved even for general orientations of the tetrahe- 
dron. Let us illustrate this last remark by the example of the 
parallel strategy. For the parallel orientation, only about half 
of the measurement outcomes will violate inequality (14.5b . the 
others would not, see Sec. lIV Bl Knowing that the input state 
is pure one can, however, ignore this inequality and always 
solve Eqs. (14. 8I >— d4~T0l > for the input state unit Pauli vector. 



VII. ADAPTIVE STRATEGIES 
A. A pre-measurement strategy 

The good performance of the anti-parallel strategy hints at 
a very simple adaptive procedure that provides the fast 1/N 
asymptotic behavior without any prior knowledge about the 
purity of the input state: Let us split the input ensemble into 
two halves. After the first A^/2 particles have been registered 
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and the direction of the input Pauli vector estimated, the exper- 
imenter adopts the anti-parallel strategy for measuring the rest 
of the ensemble. Notice that in this simple adaptive scenario, 
the first half of the particles are used for a pre-measurement 
and serve only for adjusting the measurement apparatus, while 
it is the second half which provide the actual estimate of the 
input state. 

Let denote the angle between the Pauli vector s of the in- 
put state and the Pauli vector si estimated from the first half 
of the ensemble. This angle 8 can be estimated with an accu- 
racy of 1 — cos 6* cx 1/N in the first stage. This means that, 
in the second stage, the mean probability of detecting a par- 
ticle in the channel anti-parallel to s*i will be proportional to 
1/N. No matter how large is N, only a few particles will be 
detected in this channel. 

The maximal uncertainty in the length of the Pauli vector is 
then easily calculated with the aid of 

^ = ^£^-3 (7.1) 

3 

for the estimator of Eq. J4.2I . Let us set n\ = S with 6 a small 
number independent of N, and look for the minimal length of 
the estimated Pauli vector compatible with the given 6. Since 
the right-hand side of Eq. ( 17. 1> is a concave function of rii, 
713, and 714, it is minimal when all of them are equal to each 
other, 712 = n,3 = 714 = (N — S)/3, with the consequence 

4(5 

Smin = 1 - (7.2) 

for large N. This guarantees that in the second stage both the 
orientation and length of the Pauli vector, and so the fidelity, 
will be determined with an error proportional to 1/N. Nu- 
merical simulations show that for large N (up to N = 10°) 
the mean error 1 — F of this simple protocol is about twice as 
large as, and thus worse than, that of the optimal joint POVM 
measurement, in which all N qubits are measured together. 
Let us emphasize that while our protocol would provide this 
performance for any input pure state, it also provides a mean- 
ingful estimate for any input mixed state, of course, with a 
larger uncertainty proportional to 1/ yN. In contrast to that, 
the optimal joint measurement that attains the ultimate limit 
of Eq. J6.4i would not work for mixed input states. 

B. Self -learning strategies 

It is known that sequential measurements when combined 
with self-learning adaptive strategies can come close to the 
quantum estimation limit I24l 1251 [26]. Their improvement 
on the conventional sequential measurement depends on the 
purity of the input state l24ll . Adaptive techniques are more 
sensitive to pure states than to states with a lot of classical 
noise. 

The optical network of Fig. [3b) can easily be adapted to 
a self-learning procedure, and so can other optical implemen- 
tations. After each detection the current information about 
the input state can be evaluated, and the operations a x , a y , 



and a z acting on the next particle inside the interferometer 
can be modified by a common unitary transformation. This 
is economically achieved by performing the required unitary 
transformation on the approaching photon before it enters the 
interferometer proper. 

From the discussion in Sec. lVII one might get the impression 
that a particularly good adaptive strategy would be to always 
keep one of the measured half-projectors anti-parallel to the 
current estimate. Matters are, however, not so simple. 

Let us illustrate the difference between the adaptive and the 
non-adaptive procedure at the extreme example of measur- 
ing only two qubits. In the non-adaptive case, everything is 
as discussed above, in particular the probability that the first 
qubit is detected by the j-th detector and the second by the 
fc-th detector is PjPk with the p/s of Eq. J2. lQi and, in accor- 
dance with Sec. II VI the ML estimator is S = dj if j = k and 

S = yJ'i/A(dj + afe) if j 7^ k. Upon averaging (S — s) 2 first 
over all measurement results for a given input Pauli vector s 
and then over all possible inputs, we thus get 

^ _^2 = 5_V3 =10g9 (?3) 

as the figure of merit. 

In the adaptive case, the tetrahedron is realigned for the sec- 
ond qubit after the first qubit has been detected. Since the ML 
estimator S obtained after the detection of the first qubit will 
coincide with one of the tetrahedron vectors, the anti-aligning 
of the tetrahedron for the second qubit amounts to the replace- 
ment dj — > —dj. Now the probability that the first qubit is 
detected by the j-th detector and the second by the fc-th de- 
tector is C = Pj{\ — Pk)- The resulting ML estimator is then 
given by S = \/3/8(aj — dk), and upon averaging over all 
measurement results and all input Pauli vectors we get 



(S-s) 2 = 




= 1.017, (7.4) 



which is markedly smaller than the non-adaptive value in 
il.3\ . It is also smaller that the value for the adaptive strat- 
egy with parallel alignment because that is identical with the 
non-adaptive procedure when only two qubits are detected. 

We note that the averages in (17.31 and ( 17. 41 are taken over 
pure input states. If one averages over all input states, pure 
and mixed, one obtains the respective numbers (7 — v3) /5 = 
1.054 and (7- %/6)/5 = 0.910, again with a clear advantage 
for the anti-aligning adaptive strategy. 

C. Numerical simulations 

Although these numbers speak clearly in favor of the anti- 
aligning adaptive scheme, one should, however, keep in mind 
that they apply only for the exactly aligned or anti-aligned set- 
tings of the apparatus. But, in such an adaptive scheme, after 
the first particle is observed, the uncertainty of the input state 
Pauli vector orientation is still quite large, which may result in 
a significant misalignment in the second adaptive step. In fact, 
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FIG. 7: Mean estimation errors of the POVM tetrahedrons that, on 
the Bloch sphere, differ from the exact parallel (squares) and anti- 
parallel (circles) orientations by the known angle given on the ab- 
scissa. 




FIG. 8: For each value of k, 2 between ft 2 = (perfect anti- 
alignment) and k 2 = 1/18 (perfect alignment), the possible values 
of the coefficient of the 1/TV term in Eq. J6.8I are in the area bounded 
by the two curves. The smallest coefficient obtains for the case of 
perfect alignment, when Eq. 16.91 applies. 



the anti-aligning strategy has a greater sensitivity to such mis- 
alignments. This is illustrated by the simulation data shown in 
Fig.Q where the estimation errors of both strategies are shown 
in dependence on the misalignments of the apparatuses for the 
chosen input intensity of 10 4 particles. Each point has been 
obtained by averaging over 5 x 10 5 ML estimates. 

As expected, the anti-aligning adaptive strategy performs 
better if no misalignment is present. However, even a small 
misalignment (of the order of 1° in this case) is enough to 
wash out this advantage. For even larger misalignments the 
aligned setting provides much better performance ll27Tl . Dif- 
ferent sensitivities of both measurement strategies to this kind 
of error might be of quite some importance for the potential 
applications of the minimal ellipsometer in quantum commu- 
nication protocols and quantum cryptography. 

One can understand this extreme sensitivity of the anti- 
aligned setting, and why it becomes immediately worse than 
the aligned setting, by a second look at Eqs. d6.8l > and J6. 10b . 



The k 2 = result J6. 10b is not the k 2 — » 
result j6.8l >. In fact we have, see Fig.|SJ 



1 - F 



4 

3TV 



as k — > in Eq. (I6.8i . 



limit of the k 2 > 



(7.5) 



which is larger than the error of Ffj in Eq. ( I6.9i . Therefore, 
the slightest misalignment takes us from the 1/(2N) error of 
Eq. J6 . 1 Oi to this 4/(3 AT) because il.5i applies to tiny non- 
zero values of k 2 , while ( 16 . 1 Oi holds only if k 2 = exactly. 

This observation also resolves the apparent contradiction 
between the general upper bound of Eq. J6.4i and the large- A 
error for perfect anti-alignment in Eq. j6.10L which does not 
respect that upper bound. Nevertheless, this example is not 
a valid counterexample, because it refers to an absurdly arti- 
ficial situation: The experimenter has perfect a priori knowl- 
edge of the state to be measured and has perfect control over 
his measurement apparatus, such as to ensure the perfect anti- 
alignment to which Eq. J6. 10b applies. In other words, when 
it applies, there is no need for a state estimation to begin with. 

Having thus compared the performances of the two extreme 
strategies, we now calculate the mean fidelity of the follow- 
ing adaptive measurement: After detecting each new qubit 
the information about the input state is updated and a new 
ML estimate is calculated. Then one of the measured half- 
projectors is aligned along this current estimate. These two 
steps are repeated until all input particles are used up. Fig- 
ure [9] shows mean fidelities and errors that were obtained by 
averaging over 200'000 randomly selected pure input states. 
The quantum limit, Eq. ( 16. 4t . is shown for comparison. It is 
evident that this bound can be attained only for large TV, while 
the most pronounced difference is seen for moderately-sized 
ensembles. Such a behavior is typical for all sequential self- 
learning estimation strategies. 

Finally, let us compare the efficiency of the parallel adaptive 
strategy with a very simple sequential measurement where the 
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FIG. 9: Mean error (curves a, decreasing) and mean fidelity (curves 
b, increasing) of the minimal qubit tomography as a function of TV, 
the size of the measured ensemble. Solid lines: the proposed network 
of Figs.|5|and|4|with the adaptive parallel strategy described in the 
text; broken lines: the quantum limit of Eq. 16.41 . Notice that the 
proposed optical network attains the quantum limit asymptotically 
for large TV. 
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FIG. 10: Relative difference AF, in percent, between the errors of 
the random and parallel adaptive strategies (black dots); least-square 
linear fit (straight line). The lines connecting the dots guide the eye 
but have no further significance. The irregularities stem from the 
limited number of input states (200'000) used for the averaging. 



orientation of the measured half -projectors is chosen at ran- 
dom in each step, see Fig. [TO] As expected, the adaptive strat- 
egy is better and its benefit grows with increasing size of the 
measured ensemble. 



VIII. SUMMARY 

We have presented a minimal measurement scheme for 
single-qubit tomography that has no more than the necessary 
number of four outputs. The scheme is conceptually simple, 



highly symmetric and optimal among all four-output schemes, 
and can be realized with the present technology for the polar- 
ization qubit of photons emitted by a single-photon source. 
As a demonstration, we designed a simple, but not simplest, 
optical network. 

Our thorough analysis showed that the scheme is efficient 
in the sense that it enables one to estimate the qubit state re- 
liably without first detecting an enormous number of qubits 
— a few thousand are sufficient for most practical applica- 
tions, a few hundred may be enough if extreme precision is 
not required. The efficiency can be increased by adaptive pro- 
cedures in which the apparatus is adjusted in accordance with 
the current estimate for the qubit state. 

Since the four-output setup provides optimal complete to- 
mography with the minimal number of output channels, it is 
particularly well suited as a detection device for certain quan- 
tum communication protocols such as tomographic quantum 
cryptography j28tl . Indeed, there are protocols for quantum 
key distribution that exploit the tetrahedron quartet of states 
j^jjj , among them a highly efficient tomographic protocol 
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